Nondescript: A Web Tool to Aid Subversion of Authorship Attribution

نویسندگان

  • Robin Davis
  • ROBIN DAVIS
  • William Sakas
چکیده

2016 ii This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. A person's writing style is uniquely quantifiable and can serve reliably as a biometric. A writer who wishes to remain anonymous can use a number of privacy technologies but can still be identified simply by the words they choose to use — how frequently they use common words like " of, " for instance. Nondescript is a web tool designed first to identify the user's writing style in terms of word frequency from a given writing sample and document, then to suggest how the author can change their document to lessen its probability of being attributed to them. While Nondescript does not guarantee anonymity, the web tool provides a user with an iterative interface to revise their writing and see results of a simulated authorship attribution scenario. Nondescript also provides a synonym-replacement feature, which significantly lowers the probability that a document will be attributed to the original author. v ACKNOWLEDGEMENTS I wish to thank my thesis advisor, William Sakas, for supporting me as I completed this work. I would also like to thank my previous advisor and instructor, Andrew Rosenberg, for encouraging me to pursue my interests relevant to linguistics. Finally, I would like to thank my parents for their unflagging support and love.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Questioned Electronic Documents : Empirical Studies in Authorship Attribution

Forensic analysis of questioned electronic documents is very difficult, because the nature of the documents eliminates many kinds of informative differences. Recent work in authorship attribution demonstrates the practicality of analyzing documents based on authorial style, but the state of the art is confusing. Analyses are difficult to apply, little is known about type or rate of errors, and ...

متن کامل

Automatic Authorship Attribution

In this paper we present an approach to automatic authorship attribution dealing with real-world (or unrestricted) text. Our method is based on the computational analysis of the input text using a text-processing tool. Besides the style markers relevant to the output of this tool we also use analysis-dependent style markers, that is, measures that represent the way in which the text has been pr...

متن کامل

An Overview of the Traditional Authorship Attribution Subtask

This paper describes the Traditional Authorship Attribution subtask of the PAN/CLEF 2012 workshop. As a followup to our subtask at PAN/CLEF 2011 (Amsterdam), we established a new corpus for analysis for 2012 (Rome). The new corpus differed in several ways from the previous subtask: – Both the number and size of documents were decreased – The documents were taken from a different genre (fiction,...

متن کامل

Kharazmi University Scientific Publications and Co-authorship Networks in Web of Science (1994-2020(

Background: The performance and collaboration of universities can be measured through scientific publications and scientometrics indicators. The purpose of this article is to describe the scientific publications situation of Kharazmi University and to discover the important actors of the Co-authorship networks of this university at three levels of researchers, organizations and countries in the...

متن کامل

A Prototype for Authorship Attribution Studies

Despite a century of research, statistical and computational methods for authorship attribution are neither reliable, well-regarded, widely-used, or well-understood. This paper presents a survey of the current state-ofthe-art as well as a framework for uniform and unified development of a tool to apply the state-of-the-art, despite the wide variety of methods and techniques used. The usefulness...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016